Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 296 |
| Missing cells | 409 |
| Missing cells (%) | 5.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 75.3 KiB |
| Average record size in memory | 260.4 B |
Variable types
| NUM | 15 |
|---|---|
| BOOL | 8 |
| CAT | 3 |
| DATE | 1 |
Reproduction
| Analysis started | 2020-05-05 17:14:44.664786 |
|---|---|
| Analysis finished | 2020-05-05 17:15:20.078296 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
month is highly correlated with quarter and 1 other fields | High Correlation |
quarter is highly correlated with month and 1 other fields | High Correlation |
weekofyear is highly correlated with quarter and 1 other fields | High Correlation |
meanwd_udsprevisionempresa is highly correlated with meanwd_udsventa | High Correlation |
meanwd_udsventa is highly correlated with meanwd_udsprevisionempresa | High Correlation |
udsstock has 93 (31.4%) missing values | Missing |
udsventa has 61 (20.6%) missing values | Missing |
udsprevisionempresa has 82 (27.7%) missing values | Missing |
roll4wd_udsventa has 50 (16.9%) missing values | Missing |
meanwd_udsventa has 42 (14.2%) missing values | Missing |
roll4wd_udsstock has 16 (5.4%) missing values | Missing |
roll4wd_udsprevisionempresa has 65 (22.0%) missing values | Missing |
weekday has 42 (14.2%) zeros | Zeros |
sin_weekday has 42 (14.2%) zeros | Zeros |
| Distinct count | 296 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10643.0 |
|---|---|
| Minimum | 23 |
| Maximum | 21263 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 1085 |
| Q1 | 5333 |
| median | 10643 |
| Q3 | 15953 |
| 95-th percentile | 20201 |
| Maximum | 21263 |
| Range | 21240 |
| Interquartile range (IQR) | 10620 |
Descriptive statistics
| Standard deviation | 6162.628011 |
|---|---|
| Coefficient of variation (CV) | 0.5790311013 |
| Kurtosis | -1.2 |
| Mean | 10643 |
| Median Absolute Deviation (MAD) | 5328 |
| Skewness | 0 |
| Sum | 3150328 |
| Variance | 37977984 |
| Value | Count | Frequency (%) | |
| 1535 | 1 | 0.3% | |
| 19895 | 1 | 0.3% | |
| 20399 | 1 | 0.3% | |
| 2255 | 1 | 0.3% | |
| 9095 | 1 | 0.3% | |
| 18383 | 1 | 0.3% | |
| 18455 | 1 | 0.3% | |
| 18743 | 1 | 0.3% | |
| 1751 | 1 | 0.3% | |
| 8591 | 1 | 0.3% | |
| Other values (286) | 286 | 96.6% |
| Value | Count | Frequency (%) | |
| 23 | 1 | 0.3% | |
| 95 | 1 | 0.3% | |
| 167 | 1 | 0.3% | |
| 239 | 1 | 0.3% | |
| 311 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 21263 | 1 | 0.3% | |
| 21191 | 1 | 0.3% | |
| 21119 | 1 | 0.3% | |
| 21047 | 1 | 0.3% | |
| 20975 | 1 | 0.3% |
| Distinct count | 296 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| Minimum | 2019-06-05 00:00:00 |
|---|---|
| Maximum | 2020-03-26 00:00:00 |
| Distinct count | 1 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 32 |
|---|
| Value | Count | Frequency (%) | |
| 32 | 296 | 100.0% |
Length
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 2 | 100.0% |
| Distinct count | 93 |
|---|---|
| Unique (%) | 45.8% |
| Missing | 93 |
| Missing (%) | 31.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1141.9901477832511 |
|---|---|
| Minimum | 129.0 |
| Maximum | 2416.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 129 |
|---|---|
| 5-th percentile | 483.1 |
| Q1 | 858.5 |
| median | 1124 |
| Q3 | 1401.5 |
| 95-th percentile | 1821 |
| Maximum | 2416 |
| Range | 2287 |
| Interquartile range (IQR) | 543 |
Descriptive statistics
| Standard deviation | 419.4077978 |
|---|---|
| Coefficient of variation (CV) | 0.3672604345 |
| Kurtosis | 0.2753359395 |
| Mean | 1141.990148 |
| Median Absolute Deviation (MAD) | 328.5513844 |
| Skewness | 0.3188856787 |
| Sum | 231824 |
| Variance | 175902.9009 |
| Value | Count | Frequency (%) | |
| 1511 | 6 | 2.0% | |
| 878 | 5 | 1.7% | |
| 1111 | 5 | 1.7% | |
| 710 | 5 | 1.7% | |
| 1460 | 5 | 1.7% | |
| 788 | 5 | 1.7% | |
| 1046 | 5 | 1.7% | |
| 1150 | 4 | 1.4% | |
| 840 | 4 | 1.4% | |
| 1279 | 4 | 1.4% | |
| Other values (83) | 155 | 52.4% | |
| (Missing) | 93 | 31.4% |
| Value | Count | Frequency (%) | |
| 129 | 1 | 0.3% | |
| 155 | 1 | 0.3% | |
| 323 | 1 | 0.3% | |
| 362 | 1 | 0.3% | |
| 374 | 2 | 0.7% |
| Value | Count | Frequency (%) | |
| 2416 | 1 | 0.3% | |
| 2274 | 2 | 0.7% | |
| 2219 | 1 | 0.3% | |
| 2157 | 2 | 0.7% | |
| 2080 | 1 | 0.3% |
| Distinct count | 90 |
|---|---|
| Unique (%) | 38.3% |
| Missing | 61 |
| Missing (%) | 20.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 621.9744680851064 |
|---|---|
| Minimum | 147.0 |
| Maximum | 2410.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 147 |
|---|---|
| 5-th percentile | 272 |
| Q1 | 432 |
| median | 580 |
| Q3 | 738 |
| 95-th percentile | 1082 |
| Maximum | 2410 |
| Range | 2263 |
| Interquartile range (IQR) | 306 |
Descriptive statistics
| Standard deviation | 292.6176427 |
|---|---|
| Coefficient of variation (CV) | 0.4704656826 |
| Kurtosis | 8.042008973 |
| Mean | 621.9744681 |
| Median Absolute Deviation (MAD) | 206.7211227 |
| Skewness | 2.007398432 |
| Sum | 146164 |
| Variance | 85625.08482 |
| Value | Count | Frequency (%) | |
| 688 | 8 | 2.7% | |
| 472 | 7 | 2.4% | |
| 639 | 7 | 2.4% | |
| 423 | 6 | 2.0% | |
| 442 | 6 | 2.0% | |
| 560 | 6 | 2.0% | |
| 492 | 6 | 2.0% | |
| 708 | 5 | 1.7% | |
| 511 | 5 | 1.7% | |
| 580 | 5 | 1.7% | |
| Other values (80) | 174 | 58.8% | |
| (Missing) | 61 | 20.6% |
| Value | Count | Frequency (%) | |
| 147 | 1 | 0.3% | |
| 186 | 1 | 0.3% | |
| 196 | 1 | 0.3% | |
| 206 | 1 | 0.3% | |
| 216 | 3 | 1.0% |
| Value | Count | Frequency (%) | |
| 2410 | 1 | 0.3% | |
| 1889 | 1 | 0.3% | |
| 1800 | 1 | 0.3% | |
| 1751 | 1 | 0.3% | |
| 1535 | 1 | 0.3% |
| Distinct count | 195 |
|---|---|
| Unique (%) | 91.1% |
| Missing | 82 |
| Missing (%) | 27.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3088.8271028037384 |
|---|---|
| Minimum | 51.0 |
| Maximum | 19542.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 51 |
|---|---|
| 5-th percentile | 467.6 |
| Q1 | 1456.25 |
| median | 2639.5 |
| Q3 | 3788 |
| 95-th percentile | 7415.9 |
| Maximum | 19542 |
| Range | 19491 |
| Interquartile range (IQR) | 2331.75 |
Descriptive statistics
| Standard deviation | 2519.812359 |
|---|---|
| Coefficient of variation (CV) | 0.8157829087 |
| Kurtosis | 11.80325765 |
| Mean | 3088.827103 |
| Median Absolute Deviation (MAD) | 1702.171849 |
| Skewness | 2.649602808 |
| Sum | 661009 |
| Variance | 6349454.322 |
| Value | Count | Frequency (%) | |
| 2371 | 3 | 1.0% | |
| 1389 | 2 | 0.7% | |
| 1399 | 2 | 0.7% | |
| 303 | 2 | 0.7% | |
| 3140 | 2 | 0.7% | |
| 1865 | 2 | 0.7% | |
| 3788 | 2 | 0.7% | |
| 3825 | 2 | 0.7% | |
| 4227 | 2 | 0.7% | |
| 129 | 2 | 0.7% | |
| Other values (185) | 193 | 65.2% | |
| (Missing) | 82 | 27.7% |
| Value | Count | Frequency (%) | |
| 51 | 1 | 0.3% | |
| 129 | 2 | 0.7% | |
| 159 | 1 | 0.3% | |
| 216 | 1 | 0.3% | |
| 228 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 19542 | 1 | 0.3% | |
| 16054 | 1 | 0.3% | |
| 13299 | 1 | 0.3% | |
| 9635 | 1 | 0.3% | |
| 9414 | 1 | 0.3% |
| Distinct count | 1 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 |
|---|
| Value | Count | Frequency (%) | |
| 0 | 296 | 100.0% |
festivo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 8 |
| Value | Count | Frequency (%) | |
| 0 | 288 | 97.3% | |
| 1 | 8 | 2.7% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9966216216216215 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 42 |
| Zeros (%) | 14.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.997453142 |
|---|---|
| Coefficient of variation (CV) | 0.6665683542 |
| Kurtosis | -1.241520413 |
| Mean | 2.996621622 |
| Median Absolute Deviation (MAD) | 1.706560446 |
| Skewness | 0.004680305814 |
| Sum | 887 |
| Variance | 3.989819056 |
| Value | Count | Frequency (%) | |
| 3 | 43 | 14.5% | |
| 2 | 43 | 14.5% | |
| 6 | 42 | 14.2% | |
| 5 | 42 | 14.2% | |
| 4 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 0 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 0 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 2 | 43 | 14.5% | |
| 3 | 43 | 14.5% | |
| 4 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 6 | 42 | 14.2% | |
| 5 | 42 | 14.2% | |
| 4 | 42 | 14.2% | |
| 3 | 43 | 14.5% | |
| 2 | 43 | 14.5% |
| Distinct count | 4 |
|---|---|
| Unique (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 4 | |
|---|---|
| 3 | |
| 1 | |
| 2 |
| Value | Count | Frequency (%) | |
| 4 | 92 | 31.1% | |
| 3 | 92 | 31.1% | |
| 1 | 86 | 29.1% | |
| 2 | 26 | 8.8% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
| Distinct count | 10 |
|---|---|
| Unique (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.993243243243243 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.667533456 |
|---|---|
| Coefficient of variation (CV) | 0.5244395666 |
| Kurtosis | -1.215710455 |
| Mean | 6.993243243 |
| Median Absolute Deviation (MAD) | 3.109751644 |
| Skewness | -0.3478227975 |
| Sum | 2070 |
| Variance | 13.45080165 |
| Value | Count | Frequency (%) | |
| 12 | 31 | 10.5% | |
| 10 | 31 | 10.5% | |
| 8 | 31 | 10.5% | |
| 7 | 31 | 10.5% | |
| 1 | 31 | 10.5% | |
| 11 | 30 | 10.1% | |
| 9 | 30 | 10.1% | |
| 2 | 29 | 9.8% | |
| 6 | 26 | 8.8% | |
| 3 | 26 | 8.8% |
| Value | Count | Frequency (%) | |
| 1 | 31 | 10.5% | |
| 2 | 29 | 9.8% | |
| 3 | 26 | 8.8% | |
| 6 | 26 | 8.8% | |
| 7 | 31 | 10.5% |
| Value | Count | Frequency (%) | |
| 12 | 31 | 10.5% | |
| 11 | 30 | 10.1% | |
| 10 | 31 | 10.5% | |
| 9 | 30 | 10.1% | |
| 8 | 31 | 10.5% |
| Distinct count | 43 |
|---|---|
| Unique (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.469594594594593 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 31 |
| Q3 | 42 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 15.97664889 |
|---|---|
| Coefficient of variation (CV) | 0.561182873 |
| Kurtosis | -1.229228509 |
| Mean | 28.46959459 |
| Median Absolute Deviation (MAD) | 13.65613587 |
| Skewness | -0.3266565044 |
| Sum | 8427 |
| Variance | 255.2533097 |
| Value | Count | Frequency (%) | |
| 52 | 7 | 2.4% | |
| 51 | 7 | 2.4% | |
| 29 | 7 | 2.4% | |
| 28 | 7 | 2.4% | |
| 27 | 7 | 2.4% | |
| 26 | 7 | 2.4% | |
| 25 | 7 | 2.4% | |
| 24 | 7 | 2.4% | |
| 12 | 7 | 2.4% | |
| 11 | 7 | 2.4% | |
| Other values (33) | 226 | 76.4% |
| Value | Count | Frequency (%) | |
| 1 | 7 | 2.4% | |
| 2 | 7 | 2.4% | |
| 3 | 7 | 2.4% | |
| 4 | 7 | 2.4% | |
| 5 | 7 | 2.4% |
| Value | Count | Frequency (%) | |
| 52 | 7 | 2.4% | |
| 51 | 7 | 2.4% | |
| 50 | 7 | 2.4% | |
| 49 | 7 | 2.4% | |
| 48 | 7 | 2.4% |
working_day
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 424.0 B |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) | |
| True | 246 | 83.1% | |
| False | 50 | 16.9% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.004759498821957385 |
|---|---|
| Minimum | -0.9749279121818236 |
| Maximum | 0.9749279121818236 |
| Zeros | 42 |
| Zeros (%) | 14.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | -0.9749279122 |
|---|---|
| 5-th percentile | -0.9749279122 |
| Q1 | -0.7818314825 |
| median | 0 |
| Q3 | 0.7818314825 |
| 95-th percentile | 0.9749279122 |
| Maximum | 0.9749279122 |
| Range | 1.949855824 |
| Interquartile range (IQR) | 1.563662965 |
Descriptive statistics
| Standard deviation | 0.7086201304 |
|---|---|
| Coefficient of variation (CV) | 148.8854514 |
| Kurtosis | -1.50521649 |
| Mean | 0.004759498822 |
| Median Absolute Deviation (MAD) | 0.6270716718 |
| Skewness | -0.0106157593 |
| Sum | 1.408811651 |
| Variance | 0.5021424891 |
| Value | Count | Frequency (%) | |
| 0.4338837391 | 43 | 14.5% | |
| 0.9749279122 | 43 | 14.5% | |
| -0.4338837391 | 42 | 14.2% | |
| -0.9749279122 | 42 | 14.2% | |
| -0.7818314825 | 42 | 14.2% | |
| 0.7818314825 | 42 | 14.2% | |
| 0 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| -0.9749279122 | 42 | 14.2% | |
| -0.7818314825 | 42 | 14.2% | |
| -0.4338837391 | 42 | 14.2% | |
| 0 | 42 | 14.2% | |
| 0.4338837391 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 0.9749279122 | 43 | 14.5% | |
| 0.7818314825 | 42 | 14.2% | |
| 0.4338837391 | 43 | 14.5% | |
| 0 | 42 | 14.2% | |
| -0.4338837391 | 42 | 14.2% |
cos_weekday
Real number (ℝ)
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0037955736549281846 |
|---|---|
| Minimum | -0.9009688679024191 |
| Maximum | 1.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | -0.9009688679 |
|---|---|
| 5-th percentile | -0.9009688679 |
| Q1 | -0.9009688679 |
| median | -0.222520934 |
| Q3 | 0.6234898019 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1.900968868 |
| Interquartile range (IQR) | 1.52445867 |
Descriptive statistics
| Standard deviation | 0.7079619739 |
|---|---|
| Coefficient of variation (CV) | -186.5230498 |
| Kurtosis | -1.503349059 |
| Mean | -0.003795573655 |
| Median Absolute Deviation (MAD) | 0.6408877408 |
| Skewness | 0.009053080122 |
| Sum | -1.123489802 |
| Variance | 0.5012101565 |
| Value | Count | Frequency (%) | |
| -0.222520934 | 43 | 14.5% | |
| -0.9009688679 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% | |
| -0.9009688679 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| -0.9009688679 | 42 | 14.2% | |
| -0.9009688679 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% | |
| -0.222520934 | 43 | 14.5% | |
| 0.6234898019 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 1 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| -0.222520934 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% |
is_august
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 31 |
| Value | Count | Frequency (%) | |
| 0 | 265 | 89.5% | |
| 1 | 31 | 10.5% |
spring
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 5 |
| Value | Count | Frequency (%) | |
| 0 | 291 | 98.3% | |
| 1 | 5 | 1.7% |
summer
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 188 | 63.5% | |
| 1 | 108 | 36.5% |
autumn
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 206 | 69.6% | |
| 1 | 90 | 30.4% |
winter
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 200 | 67.6% | |
| 1 | 96 | 32.4% |
stockMissingType
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 13 |
| Value | Count | Frequency (%) | |
| 0 | 203 | 68.6% | |
| 2 | 80 | 27.0% | |
| 1 | 13 | 4.4% |
Length
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 3 | 75.0% | |
| Other_Punctuation | 1 | 25.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
| Distinct count | 235 |
|---|---|
| Unique (%) | 95.5% |
| Missing | 50 |
| Missing (%) | 16.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 610.2785859465738 |
|---|---|
| Minimum | 211.71428571428572 |
| Maximum | 1294.875 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 211.7142857 |
|---|---|
| 5-th percentile | 305.125 |
| Q1 | 484 |
| median | 589.5 |
| Q3 | 726.15625 |
| 95-th percentile | 969.6 |
| Maximum | 1294.875 |
| Range | 1083.160714 |
| Interquartile range (IQR) | 242.15625 |
Descriptive statistics
| Standard deviation | 196.3305705 |
|---|---|
| Coefficient of variation (CV) | 0.3217064715 |
| Kurtosis | 0.2274704857 |
| Mean | 610.2785859 |
| Median Absolute Deviation (MAD) | 152.9350335 |
| Skewness | 0.5366137587 |
| Sum | 150128.5321 |
| Variance | 38545.69292 |
| Value | Count | Frequency (%) | |
| 503.875 | 3 | 1.0% | |
| 560 | 2 | 0.7% | |
| 451 | 2 | 0.7% | |
| 913.625 | 2 | 0.7% | |
| 601.125 | 2 | 0.7% | |
| 692 | 2 | 0.7% | |
| 564 | 2 | 0.7% | |
| 545.625 | 2 | 0.7% | |
| 352.5 | 2 | 0.7% | |
| 524.875 | 2 | 0.7% | |
| Other values (225) | 225 | 76.0% | |
| (Missing) | 50 | 16.9% |
| Value | Count | Frequency (%) | |
| 211.7142857 | 1 | 0.3% | |
| 226.875 | 1 | 0.3% | |
| 265.25 | 1 | 0.3% | |
| 273 | 1 | 0.3% | |
| 275 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 1294.875 | 1 | 0.3% | |
| 1133 | 1 | 0.3% | |
| 1105.857143 | 1 | 0.3% | |
| 1101.6 | 1 | 0.3% | |
| 1073.571429 | 1 | 0.3% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 42 |
| Missing (%) | 14.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 623.4713016078018 |
|---|---|
| Minimum | 409.2368421052632 |
| Maximum | 877.1025641025641 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 409.2368421 |
|---|---|
| 5-th percentile | 409.2368421 |
| Q1 | 500.925 |
| median | 602.5789474 |
| Q3 | 750.3 |
| 95-th percentile | 877.1025641 |
| Maximum | 877.1025641 |
| Range | 467.865722 |
| Interquartile range (IQR) | 249.375 |
Descriptive statistics
| Standard deviation | 153.9666295 |
|---|---|
| Coefficient of variation (CV) | 0.246950628 |
| Kurtosis | -0.9983662055 |
| Mean | 623.4713016 |
| Median Absolute Deviation (MAD) | 126.8200556 |
| Skewness | 0.3074290608 |
| Sum | 158361.7106 |
| Variance | 23705.72299 |
| Value | Count | Frequency (%) | |
| 602.5789474 | 43 | 14.5% | |
| 750.3 | 43 | 14.5% | |
| 598.1621622 | 42 | 14.2% | |
| 500.925 | 42 | 14.2% | |
| 409.2368421 | 42 | 14.2% | |
| 877.1025641 | 42 | 14.2% | |
| (Missing) | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 409.2368421 | 42 | 14.2% | |
| 500.925 | 42 | 14.2% | |
| 598.1621622 | 42 | 14.2% | |
| 602.5789474 | 43 | 14.5% | |
| 750.3 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 877.1025641 | 42 | 14.2% | |
| 750.3 | 43 | 14.5% | |
| 602.5789474 | 43 | 14.5% | |
| 598.1621622 | 42 | 14.2% | |
| 500.925 | 42 | 14.2% |
| Distinct count | 240 |
|---|---|
| Unique (%) | 85.7% |
| Missing | 16 |
| Missing (%) | 5.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1154.8951913265305 |
|---|---|
| Minimum | 362.0 |
| Maximum | 2416.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 362 |
|---|---|
| 5-th percentile | 593.16 |
| Q1 | 899.35625 |
| median | 1130.964286 |
| Q3 | 1390.325 |
| 95-th percentile | 1785.471429 |
| Maximum | 2416 |
| Range | 2054 |
| Interquartile range (IQR) | 490.96875 |
Descriptive statistics
| Standard deviation | 368.2275653 |
|---|---|
| Coefficient of variation (CV) | 0.3188406776 |
| Kurtosis | 0.2908491158 |
| Mean | 1154.895191 |
| Median Absolute Deviation (MAD) | 289.5266522 |
| Skewness | 0.4576719462 |
| Sum | 323370.6536 |
| Variance | 135591.5399 |
| Value | Count | Frequency (%) | |
| 710 | 10 | 3.4% | |
| 1072 | 5 | 1.7% | |
| 1279 | 4 | 1.4% | |
| 1382 | 3 | 1.0% | |
| 2157 | 3 | 1.0% | |
| 1414.625 | 2 | 0.7% | |
| 1460 | 2 | 0.7% | |
| 1925 | 2 | 0.7% | |
| 1159.25 | 2 | 0.7% | |
| 947.625 | 2 | 0.7% | |
| Other values (230) | 245 | 82.8% | |
| (Missing) | 16 | 5.4% |
| Value | Count | Frequency (%) | |
| 362 | 1 | 0.3% | |
| 452.25 | 1 | 0.3% | |
| 462 | 1 | 0.3% | |
| 468.25 | 1 | 0.3% | |
| 478 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 2416 | 1 | 0.3% | |
| 2219 | 1 | 0.3% | |
| 2157 | 3 | 1.0% | |
| 2093 | 1 | 0.3% | |
| 2042 | 1 | 0.3% |
meanwd_udsstock
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1126.9416993751045 |
|---|---|
| Minimum | 866.375 |
| Maximum | 1385.392857142857 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 866.375 |
|---|---|
| 5-th percentile | 866.375 |
| Q1 | 886.6923077 |
| median | 1113.433333 |
| Q3 | 1366.7 |
| 95-th percentile | 1385.392857 |
| Maximum | 1385.392857 |
| Range | 519.0178571 |
| Interquartile range (IQR) | 480.0076923 |
Descriptive statistics
| Standard deviation | 198.2898471 |
|---|---|
| Coefficient of variation (CV) | 0.1759539533 |
| Kurtosis | -1.518103355 |
| Mean | 1126.941699 |
| Median Absolute Deviation (MAD) | 176.0370739 |
| Skewness | 0.003869696391 |
| Sum | 333574.743 |
| Variance | 39318.86348 |
| Value | Count | Frequency (%) | |
| 1366.7 | 43 | 14.5% | |
| 1113.433333 | 43 | 14.5% | |
| 866.375 | 42 | 14.2% | |
| 1021.266667 | 42 | 14.2% | |
| 886.6923077 | 42 | 14.2% | |
| 1243.344828 | 42 | 14.2% | |
| 1385.392857 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 866.375 | 42 | 14.2% | |
| 886.6923077 | 42 | 14.2% | |
| 1021.266667 | 42 | 14.2% | |
| 1113.433333 | 43 | 14.5% | |
| 1243.344828 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 1385.392857 | 42 | 14.2% | |
| 1366.7 | 43 | 14.5% | |
| 1243.344828 | 42 | 14.2% | |
| 1113.433333 | 43 | 14.5% | |
| 1021.266667 | 42 | 14.2% |
| Distinct count | 229 |
|---|---|
| Unique (%) | 99.1% |
| Missing | 65 |
| Missing (%) | 22.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3171.4648577612866 |
|---|---|
| Minimum | 51.0 |
| Maximum | 19542.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 51 |
|---|---|
| 5-th percentile | 280.1875 |
| Q1 | 1636.4375 |
| median | 2602.75 |
| Q3 | 3706.25 |
| 95-th percentile | 7442.625 |
| Maximum | 19542 |
| Range | 19491 |
| Interquartile range (IQR) | 2069.8125 |
Descriptive statistics
| Standard deviation | 2660.373793 |
|---|---|
| Coefficient of variation (CV) | 0.8388470037 |
| Kurtosis | 10.90249732 |
| Mean | 3171.464858 |
| Median Absolute Deviation (MAD) | 1703.435039 |
| Skewness | 2.750750737 |
| Sum | 732608.3821 |
| Variance | 7077588.721 |
| Value | Count | Frequency (%) | |
| 216 | 2 | 0.7% | |
| 129 | 2 | 0.7% | |
| 4023 | 1 | 0.3% | |
| 2516 | 1 | 0.3% | |
| 1811 | 1 | 0.3% | |
| 2940 | 1 | 0.3% | |
| 2882.25 | 1 | 0.3% | |
| 1914.571429 | 1 | 0.3% | |
| 1408.875 | 1 | 0.3% | |
| 1678 | 1 | 0.3% | |
| Other values (219) | 219 | 74.0% | |
| (Missing) | 65 | 22.0% |
| Value | Count | Frequency (%) | |
| 51 | 1 | 0.3% | |
| 100 | 1 | 0.3% | |
| 129 | 2 | 0.7% | |
| 153.8571429 | 1 | 0.3% | |
| 159 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 19542 | 1 | 0.3% | |
| 16054 | 1 | 0.3% | |
| 15398.75 | 1 | 0.3% | |
| 13299 | 1 | 0.3% | |
| 13239.25 | 1 | 0.3% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2516.6168657385765 |
|---|---|
| Minimum | 216.0 |
| Maximum | 5358.631578947368 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 216 |
|---|---|
| 5-th percentile | 216 |
| Q1 | 1057.454545 |
| median | 2249.763158 |
| Q3 | 3787.487179 |
| 95-th percentile | 5358.631579 |
| Maximum | 5358.631579 |
| Range | 5142.631579 |
| Interquartile range (IQR) | 2730.032634 |
Descriptive statistics
| Standard deviation | 1577.100626 |
|---|---|
| Coefficient of variation (CV) | 0.6266749014 |
| Kurtosis | -0.6704399884 |
| Mean | 2516.616866 |
| Median Absolute Deviation (MAD) | 1257.793139 |
| Skewness | 0.3548348885 |
| Sum | 744918.5923 |
| Variance | 2487246.385 |
| Value | Count | Frequency (%) | |
| 2798.973684 | 43 | 14.5% | |
| 3787.487179 | 43 | 14.5% | |
| 1057.454545 | 42 | 14.2% | |
| 2111.026316 | 42 | 14.2% | |
| 2249.763158 | 42 | 14.2% | |
| 5358.631579 | 42 | 14.2% | |
| 216 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 216 | 42 | 14.2% | |
| 1057.454545 | 42 | 14.2% | |
| 2111.026316 | 42 | 14.2% | |
| 2249.763158 | 42 | 14.2% | |
| 2798.973684 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 5358.631579 | 42 | 14.2% | |
| 3787.487179 | 43 | 14.5% | |
| 2798.973684 | 43 | 14.5% | |
| 2249.763158 | 42 | 14.2% | |
| 2111.026316 | 42 | 14.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | fecha | producto | udsstock | udsventa | udsprevisionempresa | promo | festivo | weekday | quarter | month | weekofyear | working_day | sin_weekday | cos_weekday | is_august | spring | summer | autumn | winter | stockMissingType | roll4wd_udsventa | meanwd_udsventa | roll4wd_udsstock | meanwd_udsstock | roll4wd_udsprevisionempresa | meanwd_udsprevisionempresa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 23 | 2019-06-05 | 32 | 478.0 | 560.0 | 13299.0 | 0.0 | 0.0 | 2 | 2 | 6 | 23 | True | 0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 0.0 | 560.0 | 602.578947 | 478.0 | 1113.433333 | 13299.00 | 2798.973684 |
| 1 | 95 | 2019-06-06 | 32 | NaN | 806.0 | 19542.0 | 0.0 | 0.0 | 3 | 2 | 6 | 23 | True | 0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 2.0 | 806.0 | 750.300000 | NaN | 1366.700000 | 19542.00 | 3787.487179 |
| 2 | 167 | 2019-06-07 | 32 | NaN | 856.0 | 16054.0 | 0.0 | 0.0 | 4 | 2 | 6 | 23 | True | -0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 2.0 | 856.0 | 877.102564 | NaN | 1243.344828 | 16054.00 | 5358.631579 |
| 3 | 239 | 2019-06-08 | 32 | NaN | 275.0 | 3572.0 | 0.0 | 0.0 | 5 | 2 | 6 | 23 | True | -0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 2.0 | 275.0 | 409.236842 | NaN | 1385.392857 | 3572.00 | 1057.454545 |
| 4 | 311 | 2019-06-09 | 32 | NaN | NaN | NaN | 0.0 | 0.0 | 6 | 2 | 6 | 23 | False | -0.781831 | 0.623490 | 0 | 0 | 1 | 0 | 0 | 2.0 | NaN | NaN | NaN | 866.375000 | NaN | 216.000000 |
| 5 | 383 | 2019-06-10 | 32 | NaN | 492.0 | 4646.0 | 0.0 | 0.0 | 0 | 2 | 6 | 24 | True | 0.000000 | 1.000000 | 0 | 0 | 1 | 0 | 0 | 2.0 | 492.0 | 598.162162 | NaN | 886.692308 | 4646.00 | 2249.763158 |
| 6 | 455 | 2019-06-11 | 32 | 2157.0 | 570.0 | 3438.0 | 0.0 | 0.0 | 1 | 2 | 6 | 24 | True | 0.781831 | 0.623490 | 0 | 0 | 1 | 0 | 0 | 0.0 | 570.0 | 500.925000 | 2157.0 | 1021.266667 | 3438.00 | 2111.026316 |
| 7 | 527 | 2019-06-12 | 32 | 1834.0 | 688.0 | 1448.0 | 0.0 | 0.0 | 2 | 2 | 6 | 24 | True | 0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 0.0 | 592.0 | 602.578947 | 817.0 | 1113.433333 | 10336.25 | 2798.973684 |
| 8 | 599 | 2019-06-13 | 32 | 362.0 | 590.0 | 2969.0 | 0.0 | 0.0 | 3 | 2 | 6 | 24 | True | 0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 0.0 | 752.0 | 750.300000 | 362.0 | 1366.700000 | 15398.75 | 3787.487179 |
| 9 | 671 | 2019-06-14 | 32 | 1382.0 | 1092.0 | 4795.0 | 0.0 | 0.0 | 4 | 2 | 6 | 24 | True | -0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 0.0 | 915.0 | 877.102564 | 1382.0 | 1243.344828 | 13239.25 | 5358.631579 |
Last rows
| df_index | fecha | producto | udsstock | udsventa | udsprevisionempresa | promo | festivo | weekday | quarter | month | weekofyear | working_day | sin_weekday | cos_weekday | is_august | spring | summer | autumn | winter | stockMissingType | roll4wd_udsventa | meanwd_udsventa | roll4wd_udsstock | meanwd_udsstock | roll4wd_udsprevisionempresa | meanwd_udsprevisionempresa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 286 | 20615 | 2020-03-17 | 32 | NaN | 1800.0 | 1810.0 | 0.0 | 0.0 | 1 | 1 | 3 | 12 | True | 0.781831 | 0.623490 | 0 | 0 | 0 | 0 | 1 | 2.0 | 554.125000 | 500.925000 | 871.428571 | 1021.266667 | 2910.125 | 2111.026316 |
| 287 | 20687 | 2020-03-18 | 32 | NaN | 600.0 | 3089.0 | 0.0 | 0.0 | 2 | 1 | 3 | 12 | True | 0.974928 | -0.222521 | 0 | 0 | 0 | 0 | 1 | 2.0 | 490.250000 | 602.578947 | 1178.750000 | 1113.433333 | 3965.000 | 2798.973684 |
| 288 | 20759 | 2020-03-19 | 32 | NaN | 1889.0 | 3140.0 | 0.0 | 0.0 | 3 | 1 | 3 | 12 | True | 0.433884 | -0.900969 | 0 | 0 | 0 | 0 | 1 | 2.0 | 727.750000 | 750.300000 | 1460.000000 | 1366.700000 | 4681.375 | 3787.487179 |
| 289 | 20831 | 2020-03-20 | 32 | 581.0 | 2410.0 | 3776.0 | 0.0 | 0.0 | 4 | 1 | 3 | 12 | True | -0.433884 | -0.900969 | 0 | 0 | 0 | 0 | 1 | 0.0 | 1294.875000 | 877.102564 | 839.500000 | 1243.344828 | 7687.500 | 5358.631579 |
| 290 | 20903 | 2020-03-21 | 32 | 788.0 | 403.0 | NaN | 0.0 | 0.0 | 5 | 1 | 3 | 12 | True | -0.974928 | -0.222521 | 0 | 0 | 0 | 0 | 1 | 0.0 | 852.000000 | 409.236842 | 837.000000 | 1385.392857 | NaN | 1057.454545 |
| 291 | 20975 | 2020-03-22 | 32 | 1137.0 | NaN | NaN | 0.0 | 0.0 | 6 | 1 | 3 | 12 | False | -0.781831 | 0.623490 | 0 | 1 | 0 | 0 | 1 | 0.0 | NaN | NaN | 593.800000 | 866.375000 | NaN | 216.000000 |
| 292 | 21047 | 2020-03-23 | 32 | 1137.0 | NaN | 1389.0 | 0.0 | 0.0 | 0 | 1 | 3 | 13 | True | 0.000000 | 1.000000 | 0 | 1 | 0 | 0 | 1 | 0.0 | 565.857143 | 598.162162 | 593.800000 | 886.692308 | 2296.750 | 2249.763158 |
| 293 | 21119 | 2020-03-24 | 32 | 384.0 | NaN | 452.0 | 0.0 | 0.0 | 1 | 1 | 3 | 13 | True | 0.781831 | 0.623490 | 0 | 1 | 0 | 0 | 1 | 0.0 | 929.857143 | 500.925000 | 770.000000 | 1021.266667 | 2238.125 | 2111.026316 |
| 294 | 21191 | 2020-03-25 | 32 | 905.0 | NaN | 1818.0 | 0.0 | 0.0 | 2 | 1 | 3 | 13 | True | 0.974928 | -0.222521 | 0 | 1 | 0 | 0 | 1 | 0.0 | 550.571429 | 602.578947 | 1072.500000 | 1113.433333 | 3365.000 | 2798.973684 |
| 295 | 21263 | 2020-03-26 | 32 | 1279.0 | NaN | 708.0 | 0.0 | 0.0 | 3 | 1 | 3 | 13 | True | 0.433884 | -0.900969 | 0 | 1 | 0 | 0 | 1 | 0.0 | 1063.857143 | 750.300000 | 1279.000000 | 1366.700000 | 3582.000 | 3787.487179 |